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AMENDMENTS TO THE CLAIMS 

This listing of claims will replace all prior versions, and listings, of claims in the 
appHcation: 

1 1 . (Previously Presented) Method for detecting voice activity in a digital speech signal in at 

2 least one frequency band, wherein the voice activity is detected on the basis of an analysis 

3 comprising the step of comparing two different versions of the speech signal, wherein at least 

4 one of said versions is a denoised version obtained by taking account of estimates of noise 

5 included in the signal. 

1 2. (Previously Presented) Method according to claim 1, wherein said comparison is 

2 performed on respective energies, evaluated in said frequency band, of the two different versions 

3 of the speech signal, or to a monotonic ftmction of said energies. 

1 3. (Currently Amended) Method according to claim 1, wherein said analysis fiirther 

2 comprises a time smoothing of the energy of one of said versions of the speech signal, and a 

3 comparison between the energy of said version and the smooth smoothed energy. 

1 4. (Currently Amended) Method according to claim 3, wherein the comparison between the 

2 energy of said version and the smooth smoothed energy controls transitions of a voice activity 

3 detection automaton from a speech state to a silence state, and wherein the comparison of the 

4 two different versions of the speech signal controls transitions of the detection automaton from 

5 the silence state to the speech state. 

1 5. (Previously Presented) Method according to claim 1, wherein the two different versions 

2 of the speech signal are two versions denoised by non-Unear spectral subtraction, wherein a first 

3 of the two versions is denoised in such a way as not to be less, in the spectral domain, than a first 

4 fraction of a long-term estimate representative of a noise component included in the speech 

5 signal, and the second of the two versions is denoised in such a way as not to be less, in the 
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6 spectral domain, than a second fraction of said long-term estimate, smaller than said first 

7 fraction. 

1 6. (Previously Presented) Method according to claim 5, wherein said analysis further 

2 comprises a time smoothing of the energy of each of the two versions of the speech signal, by 

3 means of a smoothing window determined by comparing the energy of the second of the two 

4 versions with the smoothed energy of the second of the two versions. 

1 7. (Previously Presented) Method according to claim 6, wherein the smoothing window is 

2 an exponential window defined by a forgetting factor. 

1 8. (Previously Presented) Method according to claim 7, comprising the step of allocating a 

2 substantially zero value to the forgetting factor when the energy of the second of the two 

3 versions is less than a value of the order of the smoothed energy of the second of the two 

4 versions. 

1 9. (Currently Amended) Method according to claim 8, comprising the step of allocating a 

2 first value substantially equal to 1 to the forgetting factor when the energy of the second of the 

3 two versions is greater than said value of the order of the smooth smoothed energy multiplied by 

4 a coefficient bigger than 1, and allocating a second value lying between 0 and said first value to 

5 the forgetting factor when the energy of the second of the two versions is greater than said value 

6 of the order of the smoothed energy and less than said value of the order of the smoothed energy 

7 multiplied by said coefficient. 

1 10. (Currently Amended) Method according to claim 4- 5, wherein the first and second 

2 fractions correspond substantially to attenuations of 10 dB and 60 dB, respectively. 

1 11. (Previously Presented) Method according to claim 1, wherein the comparison of the two 

2 different versions of the speech signal is performed on respective differences between the 
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3 energies of said two versions in said frequency band and a lower bound of the energy of the 

4 denoised version of the speech signal in said frequency band. 

1 12. (Previously Presented) Method according to claim 11, wherein one of the two different 

2 versions of the speech signal is a non-denoised version of the speech signal. 

1 13. (Previously Presented) Device for detecting voice activity in a speech signal, comprising 

2 signal processing means for analyzing the speech signal in at least one frequency band, wherein 

3 the processing means comprise means for comparing two different versions of the speech signal, 

4 wherein at least one of said versions is a denoised version, obtained by taking account of 

5 estimates of noise included in the signal. 

1 14.-15. (Cancelled) 

1 16. (Previously Presented) Device according to claim 13, wherein the processing means 

2 comprise means for evaluating, in said frequency band, energies of said two different versions of 

3 the speech signal, whereby inputs of the comparison means comprise said energies or a 

4 monotonic fimction of said energies. 

1 17. (Previously Presented) Device according to claim 13, wherein the processing means 

2 fiirther comprises means for performing a time smoothing of the energy of one of said versions 

3 of the speech signal, and means for comparing the energy of said version and the smoothed 

4 energy. 

1 18. (Previously Presented) Device according to claim 17, wherein the processing means 

2 comprise a voice activity detection automaton having a pluraUty of states including a speech 

3 state and a silence state, means for controlling transitions of the voice activity detection 

4 automaton from the speech state to the silence state based on a comparison between the energy 

5 of said version and the smoothed energy, and means for controlling transitions of the voice 
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6 activity detection automaton from the silence state to the speech state based on a comparison of 

7 the two different versions of the speech signal. 

1 19. (Previously Presented) Device according to claim 13, further comprising first non-linear 

2 spectral subtraction means to provide a first of the two versions of the speech signal as a 

3 denoised version which is not less, in the spectral domain, than a first fraction of a long-term 

4 estimate representative of a noise component included in the speech signal, and second non- 

5 linear spectral subtraction means to provide a second of the two versions of the speech signal as 

6 a denoised version which is not less, in the spectral domain, than a second fraction of said long- 

7 term estimate, said second fraction being smaller than said first fraction. 

1 20. (Currently Amended) Device according to claim 19, wherein the processing means 

2 further comprises means for performing a time smoothing of the energy of each of the two 

3 versions of the speech signal, by means of a smoothing window determined by comparing an 

4 energy of the second of the two versions with the smoothing smoothed energy of the second of 

5 the two versions. 

1 21. (Previously Presented) Device according to claim 20, wherein the smoothing window is 

2 an exponential window defined by a forgetting factor. 

1 22. (Previously Presented) Device according to claim 21, wherein the processing means 

2 further comprises means for allocating a substantially zero value to the forgetting factor when the 

3 energy of the second of the two versions is less than a value of the order of the smoothed energy 

4 of the second of the two versions. 

1 23. (Currently Amended) Device according to claim 22, wherein the processing means 

2 further comprises means for allocating a first value substantially equal to 1 to the forgetting 

3 factor when the energy of the second of the two versions is greater than said value of the order of 

4 the smoothed energy multiplied by a coefficient bigger than 1, and for allocating a second value 

5 lying between 0 and said first value to the forgetting factor when the energy of the second of the 
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6 two versions is greater than said value of the order of the smoothed energy and less than said 

7 value of the order of the smooth smoothed energy multiplied by said coefficient. 

1 24. (Currently Amended) Device according to claim 19 44, wherein the first and second 

2 fractions correspond substantially to attenuations of 10 dB and 60 dB, respectively. 

1 25. (Previously Presented) Device according to claim 13, wherein the comparison of the two 

2 different versions of the speech signal is performed on respective differences between the 

3 energies of said two versions in said frequency band and a lower bound of the energy of the 

4 denoised version of the speech signal in said frequency band. 

1 26. (Previously Presented) Device according to claim 25, wherein one of the two different 

2 versions of the speech signal is a non-denoised version of the speech signal. 

1 27. (Previously Presented) A computer program product, loadable into a memory associated 

2 with a processor, and comprising portions of code for execution by the processor to detect voice 

3 activity in an input digital speech signal in at least one frequency band, whereby the voice 

4 activity is detected on the basis of an analysis comprising the step of comparing two different 

5 versions of the speech signal, wherein at least one of said versions is a denoised version obtained 

6 by taking account of estimates of noise included in the signal. 

1 28. (Previously Presented) A computer program product according to claim 27, wherein said 

2 comparison is performed on respective energies, evaluated in said frequency band, of the two 

3 different versions of the speech signal, or to a monotonic function of said energies. 

1 29. (Currently Amended) A computer program product according to claim 27 4-, wherein 

2 said analysis further comprises a time smoothing of the energy of one of said versions of the 

3 speech signal, and a comparison between the energy of said version and ef the smoothed energy. 



7 



Appln, Serial No. 09/806,756 

Amendment Dated November 2, 2004 

Reply to Office Action Mailed August 4, 2004 



1 30. (Previously Presented) A computer program product according to claim 29, wherein the 

2 comparison between the energy of said version and the smoothed energy controls transitions of a 

3 voice activity detection automaton from a speech state to a silence state, and wherein the 

4 comparison of the two different versions of the speech signal controls transitions of the detection 

5 automaton from the silence state to the speech state. 

1 31. (Previously Presented) A computer program product according to claim 27, wherein the 

2 two different versions of the speech signal are two versions denoised by non-linear spectral 

3 subtraction, wherein a first of the two versions is denoised in such a way as not to be less, in the 

4 spectral domain, than a first fraction of a long-term estimate representative of a noise component 

5 included in the speech signal, and the second of the two versions is denoised in such a way as not 

6 to be less, in the spectral domain, than a second fraction of said long-term estimate, smaller than 

7 said first fraction. 

1 32. (Previously Presented) A computer program product according to claim 31, wherein said 

2 analysis fiirther comprises a time smoothing of the energy of each of the two versions of the 

3 speech signal, by means of a smoothing window determined by comparing the energy of the 

4 second of the two versions with the smoothed energy of the second of the two versions. 

1 33. (Previously Presented) A computer program product according to claim 32, wherein the 

2 smoothing window is an exponential window defined by a forgetting factor. 

1 34. (Previously Presented) A computer program product according to claim 33, wherein said 

2 analysis fixrther comprises the step of allocating a substantially zero value to the forgetting 

3 factor when the energy of the second of the two versions is less than a value of the order of the 

4 smoothed energy of the second of the two versions. 

1 35. (Previously Presented) A computer program product according to claim 34, wherein said 

2 analysis fiirther comprises the steps of allocating a first value substantially equal to 1 to the 

3 forgetting factor when the energy of the second of the two versions is greater than said value of 
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4 the order of the smoothed energy multipUed by a coefficient bigger than 1, and allocating a 

5 second value lying between 0 and said first value to the forgetting factor v^hen the energy of the 

6 second of the two versions is greater than said value of the order of the smoothed energy and less 

7 than said value of the order of the smoothed energy multiplied by said coefficient. 

1 36. (Currently Amended) A computer program product according to claim 3i 27, wherein 

2 the first and second fractions correspond substantially to attenuations of 10 dB and 60 dB, 

3 respectively. 

1 37. (Previously Presented) A computer program product according to claim 27, wherein the 

2 comparison of the two different versions of the speech signal is performed on respective 

3 differences between the energies of said two versions in said frequency band and a lower bound 

4 of the energy of the denoised version of the speech signal in said frequency band. 

1 38. (Previously Presented) A computer program product according to claim 37, wherein one 

2 of the two different versions of the speech signal is a non-denoised version of the speech signal. 
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